AITopics | Gladwin County

Collaborating Authors

Gladwin County

Contextual Document Embeddings

arXiv.org Artificial IntelligenceNov-8-2024

Dense document embeddings are central to neural retrieval. The dominant paradigm is to train and construct embeddings by running encoders directly on individual documents. In this work, we argue that these embeddings, while effective, are implicitly out-of-context for targeted use cases of retrieval, and that a contextualized document embedding should take into account both the document and neighboring documents in context - analogous to contextualized word embeddings. We propose two complementary methods for contextualized document embeddings: first, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss; second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation. Results show that both methods achieve better performance than biencoders in several settings, with differences especially pronounced out-of-domain. We achieve state-of-the-art results on the MTEB benchmark with no hard negative mining, score distillation, dataset-specific instructions, intra-GPU example-sharing, or extremely large batch sizes. Our method can be applied to improve performance on any contrastive learning dataset and any biencoder.

information retrieval, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.02525

Country:

North America > United States > Montana > Flathead County (0.04)
North America > United States > Michigan > Iosco County (0.04)
North America > United States > California (0.04)
(16 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Law (1.00)
Leisure & Entertainment > Sports > Football (0.67)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

RATE: Score Reward Models with Imperfect Rewrites of Rewrites

Reber, David, Richardson, Sean, Nief, Todd, Garbacea, Cristina, Veitch, Victor

arXiv.org Artificial IntelligenceOct-15-2024

This paper concerns the evaluation of reward models used in language modeling. A reward model is a function that takes a prompt and a response and assigns a score indicating how good that response is for the prompt. A key challenge is that reward models are usually imperfect proxies for actual preferences. For example, we may worry that a model trained to reward helpfulness learns to instead prefer longer responses. In this paper, we develop an evaluation method, RATE (Rewrite-based Attribute Treatment Estimators), that allows us to measure the causal effect of a given attribute of a response (e.g., length) on the reward assigned to that response. The core idea is to use large language models to rewrite responses to produce imperfect counterfactuals, and to adjust for rewriting error by rewriting twice. We show that the RATE estimator is consistent under reasonable assumptions. We demonstrate the effectiveness of RATE on synthetic and real-world data, showing that it can accurately estimate the effect of a given attribute on the reward model.

causal explainability, reward model, rewrite, (13 more...)

arXiv.org Artificial Intelligence

2410.11348

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Indiana > Marion County > Indianapolis (0.04)
Asia > India (0.04)
(8 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine (1.00)
Media > Film (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback